All Questions
5 questions
1vote
1answer
122views
RL agent for autonomous vehicle is able to follow the road but can't avoid crashing at all (Highway-Env / Racetrack Env.)
I coded some deep RL algorithms (DQN and SAC) with tf2/keras to solve an environment where a vehicle needs to follow the track and avoid crashing into one other vehicle (there is only one other ...
3votes
0answers
152views
Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?
I have a Reinforcement Learning problem where the optimal policy does not depend on the next state (ie gamma equals 0). I think this means that I only need an efficient exploration algorithm coupled ...
1vote
0answers
318views
How to parallelize multi-agent DDPG (MADDPG)
I am experimenting with MADDPG algorithm implemented in this repo. Since there were only a few agents (2-3) in the implementation (also in the original paper) steps like parameter updates, action ...
1vote
0answers
130views
How should I simulate this Markov Decision Process?
I am working on solving a problem on nodes in a graph communicating with each other. They try to estimate a central state using Kalman consensus filter, with the connections described by the graph's ...
3votes
0answers
66views
Deep Q-Learning agent poor performing actions. Need help optimizing
I'm trying to make deep q-learning agent from https://keon.io/deep-q-learning My environment looks like this: https://i.sstatic.net/EJHTD.jpg As you can see my agent is a circle and there is one ...